Introduction

1. Research Questions & Motivation

Project Overview

This interactive dashboard provides a comprehensive exploratory analysis of Early-Onset Colorectal Cancer (EOCRC) using data from the Surveillance, Epidemiology, and End Results (SEER) program. The dataset includes demographic, clinical, and socioeconomic variables linked to colorectal cancer diagnoses between 2018 and 2021. The goal is to enable data-driven decision-making for researchers and policymakers.

This study investigates disparities in Early-Onset Colorectal Cancer (EOCRC) and Late-Onset Colorectal Cancer (LOCRC), focusing on demographic, socioeconomic, and geographic factors.

Key questions:

  • How do age, sex, and race/ethnicity influence stage at diagnosis?
  • What is the relationship between household income and stage of EOCRC?
  • How do geographic disparities impact access and diagnosis?
  • What clinical factors affect EOCRC development in adults aged 18–49?

2. Objectives

  • Analyze demographic patterns in EOCRC cases.
  • Explore disparities across race, sex, and socioeconomic status.
  • Investigate urban vs. rural classification and its impact on cancer diagnosis.
  • Enable interactive data filtering to facilitate real-time analysis and insights.

Data Source

Data Source

  • The data source is the SEER Program (https://seer.cancer.gov/), a trusted U.S. cancer surveillance system that requires an account to access the data. There were 144,788 cases (Early-Onset Colorectal Cancer (EOCRC) ages 18-49, Late-Onset Colorectal Cancer (LOCRC): ages 50+). The data were collected using hospital-based cancer registries, pathology reports, and medical records. The study population consisted of diverse EOCRC and LOCRC patients across multiple geographic regions in the U.S. The time period of the data was 2018-2021, capturing recent trends in cancer disparities.

Project Description

EOCRC Visualization Bar Chart Description

  1. Demographics
    • Age Groups: 18-49 Early-Onset Colorectal Cancer (EOCRC), 50+ Late-Onset Colorectal Cancer (LOCRC)
    • Sex: Male, Female
    • Race/Ethnicity:
      • Non-Hispanic White
      • Non-Hispanic Black
      • Hispanic
      • Non-Hispanic Asian/Pacific Islander
      • Non-Hispanic American Indian/Alaska Native
  2. Cancer Details
    • Histology: Type of cancer cells
    • Stage at Diagnosis: Localized, regional, distant
  3. Socioeconomic Factors
    • Household Income: Inflation-adjusted to 2022 (categorized)
    • Rural vs. Urban: Metropolitan or nonmetropolitan areas
  4. Diagnosis & Clinical Data
    • Year of Diagnosis: 2018-2021
    • SEER Registry: SEER registry (with CA and GA as whole states)
  5. Interactive Bar Chart (Plotly - EOCRC Cases by Race/Ethnicity)
    • X-Axis: Race/Ethnicity categories
    • Y-Axis: Number of Early-Onset Colorectal Cancer (EOCRC)/Late-Onset Colorectal Cancer (LOCRC) cases
    • Format: Stacked bar chart
  6. Interactive Features
    • Hover Tooltips: Displays exact case counts for each racial/ethnic group.
    • Legend Toggle: Allows users to filter selected racial/ethnic groups.
    • Zooming & Panning: Enables in-depth analysis of disparities.
  7. Purpose
    • Identify racial and ethnic disparities in colorectal cancer incidence.
    • Analyze potential risk factors for different populations.
    • Assess healthcare access inequalities affecting early detection.

Filterable Data Table (DT) with Description

  1. Filterable Data Table (DT) – DescriptionPatient Demographics & Clinical Characteristics searchable & sortable table for patient demographics and clinical data.

    • X-Axis: Race/Ethnicity categories
    • Y-Axis: Number of Early-Onset Colorectal Cancer (EOCRC)/Late-Onset Colorectal Cancer (LOCRC) cases
    • Format: Stacked bar chart
  2. Interactive Features

    • Dropdown Filters: Age, sex, race/ethnicity, tumor stage, income level, SEER Registry, Rural-Urban Classification (Metro/Non-Metro, Early-Onset Cancer (18-49) (Yes/No), Combined Summary Stage (2004+)
    • Search Functionality: Quick lookup of patient records.
    • Sortable Columns: Diagnose trends over time.
    • Pagination & Adjustable Display: Customizable data views.
  3. Purpose

    • Compare EOCRC vs. LOCRC patient characteristics.
    • Identify patterns in cancer progression.
    • Support research & policy interventions for cancer prevention.

EOCRC Visualization Bar Chart

Row

Column

Interactive Bar Chart (Plotly - Early-Onset Colorectal Cancer (EOCRC) Cases by Race/Ethnicity)

Column

Key takeaways description

Key takeaways for bar graph: The majority of colorectal cancer cases occurred in people aged 50+. Non-Hispanic Whites made up more than 50% of all CRC cases, while Non-Hispanic American Indians/Alaska Natives made up less than 1% of all Colorectal Cancer (CRC) cases.

Filterable Data Table (DT)

Row

Column

Filterable Data Table (DT) – Patient Demographics & Clinical Characteristics

Column

Key takeaways description

Key takeaways from filterable data table: Approximately 47% of all colorectal cancer cases were from females and 53% were from males. 126,003 cases were LOCRC, while 18,785 cases were EOCRC. 2020 had the lowest number of CRC cases with 33,582 cases.

Real world Impact